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(57) Abstract 

The present invention relates to fluorescent proteins derived from Green Fluorescent Protein (GFP) or any functional analogue thereof, 
wherein the amino acid in position 1 preceding the chromophore has been mutated to provide an increase of fluorescence intensity. 
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Novel variants of green fluorescent protein, GFP. 
FIELD OF THE INVENTION 

The present invention relates to novel variants of the fluorescent 
protein GFP having improved fluorescence properties. 

5 BACKGROUND OF THE INVENTION _ 

The discovery that Green Fluorescent Protein (GFP) from "the 
jellyfish A. victoria retains its fluorescent properties when 
expressed in heterologous cells has provided biological research 
with a new, unique and powerful tool (Chalfie et al (1994) . 
10 Science 263:802; Prasher (1995) Trends in Genetics 11:320; WO 
95/07463) . " UX " 

Furthermore, the discovery of a blue fluorescent variant of "GFP 
(Heim et ,al . (1994). proc. Natl. Acad. Sci. 91:12501) has' greatly 
increased the potential applications of using fluorescent 
15 recombinant probes to monitor cellular events or functions, since 
the availability of probes having different excitation and 
emission spectra permits simultaneous monitoring of more than one 
process . 

However, the blue fluorescing variant described by Heim et al, 
20 Y66H-GFP, suffers from certain limitations: The blue fluorescence 
is weak (emission maximum at 448nm) , thus making detection 
difficult, and necessitating prolonged excitation of cells 
expressing Y66H-GFP. Moreover, the prolonged period of excitation 
is damaging to cells especially because the excitation wavelength 
25 is in the UV range, 360nm - 390nm. 

A very important aspect of using recombinant, fluorescent proteins 
in studying cellular functions is the non-invasive nature of the 
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assay. This allows detection of cellular events in intact, living 
cells. A limitation with current fluorescent proteins is, however, 
that relatively high intensity light sources are needed for 
visualization. Especially with the blue variant, Y66H-GFP, it is 
5 necessary to excite with intensities that are damaging to most 
cells. It is worth mentioning that some cellular events like 
oscillations in intracellular signalling systems, e.g. cytosolic 
free calcium, are very photo sensitive. A further consequence of 
the low light emittance is that only high levels of expression can 
10 be detected. Obtaining such high level expression may stress the 
transcriptional and/or translational machinery of the cells. 

The excitation spectrum of the green fluorescent protein from 
Aequorea victoria shows two peaks: A major peak at 396nm, which 
is in the potentially cell damaging UV range, and a lesser peak 

15 at 475nm, which is in an excitation range that is much less 
^^Vi^ O CGllG ' Heim et al. (1995) , Nature, Vol. 373/ -pF^63-4, 
^S^°** s ,, a . Ser65Thr mutation of GFP (S65T) having logger 
. . of . - excitation and emission, 4 9 Onm and SE 51orim f 
respectively, than the wild-type GFP and wherein the f lA^roptfdre^ 

20 formation proceeded about fourfold more rapidly than in the wild- 
type GFP. .- v 

Expression of GFP or its fluorescent variants in living cells 
provides a valuable tool for studying cellular events and it is 
well known that many cells, including mammalian cells, are 

25 incubated at approximately 37 "C in order to secure optimal and/or 
physiologically relevant growth. Cell lines originating from 
different organisms or tissues may have different relevant 
temperatures ranging from about 35'C for fibroblasts to about 38 'C 
- 39 "C for mouse p-cells. Experience has shown, however, that the 

3 0 fluorescent signal from cells expressing GFP is weak or absent 
when said cells are incubated at temperatures above - room 
temperature, cf. Webb, CD. et al., Journal of Bacteriology, Oct. 
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1995, p. 5906-5911. Ogawa H. et al., Proc. Natl. Acad. Sci. USA, 
Vol. 92, pp. 11899-11903, December 1995, and Lim et al . J. 
Biochem. 118, 13-17 (1995) . The improved fluorescent variant S65T 
described by Heim et al. (1995) supra also displays very low 
5 fluorescence when incubated under normal culture conditions 
(37*C), cf. Kaether and Gerdes FEBS Letters 369 (1995) pp. 267- 
271. Many experiments involving the study of cell metabolism are 
dependent on the possibility of incubating the cells at 
physiologically relevant temperatures, i.e. temperatures at about 
10 37*C. 

SUMMARY OF THE INVENTION 

The purpose of the present invention is to provide novel 
fluorescent proteins , such as F64L-GFP, F64L-Y66H-GFP and F64L- 
S65T-GFP that result in a cellular fluorescence far exceeding the 
15 cellular fluorescence from cells expressing th£ parent proteins, 
i.e.. GFP, the blue variant Y66H-GFP and the S 6 5T-GFP variant, 
respectively. This greatly improves the usefulness of fluorescent 
proteins in studying cellular functions in living cells. 

A further purpose of the invention is to provide novel fluorescent 
20 proteins that exhibit high fluorescence in cells expressing them 
when said cells are incubated at a temperature of 30*C or above, 
preferably at a temperature of from 32 'C to 39 *C, more preferably 
at a temperature of from 35'C to 38*C, and most preferably at a 
temperature of about 37 'C. 

25 It is known that fluorescence in wild-type GFP is due to the 
presence of a chromophore, which is generated by cyclisation ,and 
oxidation of the SYG at position 65-67 in the predicted primary 
amino acid sequence and presumably by the same reasoning of the 
SHG sequence and other GFP analogues at positions 65-67, cf. Heim 

30 et al. (1994). Surprisingly, we have found that a mutation, 
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preferably a substitution, of the F amino acid residue at position 
1 preceding the s of the SYG or SHG chromophore or the T of the 
THG chromophore, in casu position 64 in the predicted primary 
amino acid sequence, results in a substantial increase of 
5 fluorescence intensity apparently without shifting the excitation 
and emission wavelengths. This increase is remarkable for the blue 
variant Y66H-GFP, which hitherto has not been useful in biological 
systems because of its weak fluorescence. 

The F64L, F64I, F64V, F64A, and F64G substitutions are preferred 
10 the F64L substitution being most preferred, but other mutations,' 
e.g. deletions, insertions, or posttranslational modifications 
immediately preceding the chromophore are also included in the 
invention, provided that they result in improved fluorescence 
properties of the various fluorescent proteins .~ It should be noted 
15 that , e ^ nsive deletions may result in loss of the^' f iuoifSsc%nt 
? r :°^?^ ieS ° f GFP - Zt nas been shown, that only one > res iaiS^an 
be sacr f. fic ^ d the amino terminus and less than 10 or 15^ from 

the carboxyl terminus before fluorescence is lost, cf. Cubitt^et 
al. TIBS Vol. 20 (11), pp. 448-456, November 1995. 

2 0 Accordingly, . one aspect of the present invention relates to a 
fluorescent protein derived from Aeguorea Green Fluorescent 
Protein (GFP) or any functional analogue thereof, wherein the 
amino acid in position 1 upstream from the chromophore has been 
mutated to provide an increase of fluorescence intensity when the 

25 fluorescent protein of the invention is expressed in cells. 
Surprisingly, said mutation also results in a significant increase 
of the intensity of the fluorescent signal from cells expressing 
the mutated GFP and incubated at 3 0 *C or above 30 'C, preferably 
at about 37 iC, compared to the prior art GFP variants. 

30 There are several advantages of the proteins of the invention, 
including: 
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Excitation with low energy light sources. Due to the high degree 
of brightness of F64L-Y66H— GFP and F64L-GFP their emitted light 
can be detected even after excitation with low energy light 
sources. Thereby it is possible to study cellular phenomena , such 
5 as oscillations in intracellular signalling systems, that are 
sensitive to light induced damage. As the intensity of the emitted 
light from the novel blue and green emitting fluorescent proteins 
are of the same magnitude, it is possible to visualize them_ 
simultaneously using the same light source. 

10 A real time reporter for gene expression in living cells is now 
possible, since the fluorescence from F64L-Y66H-GFP and F64L-GFP 
reaches a detectable level much faster than from wild type GFP, 
and prior known derivatives thereof. Hence, it is more suitable 
for real time studies of gene expression in living cells. 

15, Detectable fluorescence may be obtained faster due to shorter 
maturation time of the chromophore, higher emission intensity , or, 
.a -more J5 table protein or a combination thereof . 

Simultaneous expression of the novel fluorescent proteins under 
control of two or more separate promoters. 

20 Expression of more than one gene can be monitored simultaneously 
without any damage to living cells. 

Simultaneous expression of the novel proteins using one reporter 
as internal reference and the other as variable marker, since 
regulated expression of a gene can be monitored quantitatively by 
25 fusion of a promoter to e.g. F64L-GFP (or F64L-Y66H-GFP) , 
measuring the fluorescence, and normalizing it to the fluorescence 
of constitutively expressed F64L-Y66H-GFP (or* F64L-GFP) . The 
constitutively expressed F64L-Y66H— GFP (or F64L-GFP) works as 
internal reference. 



OCIO:-rtVO 9711094A1> 



WO 97/1 1094 



6 



PCT/DK96/00051 



Use as a protein tag in living and fixed cells. Due to the strong 
fluorescence the novel proteins are suitable tags for proteins 
present at low concentrations. Since no substrate is needed and 
visualisation of the cells do not damage the cells dynamic " 
5 analysis can .be performed. 

Use as an organelle tag. More than one organelle can be tagged and 
visualised simultaneously in living cells, e.g. the endoplasmic, 
reticulum and the cytoskeleton. 

Use as markers in cell or organelle fusions. By labelling two or 
10 more cells or organelles with the novel proteins, e.g. F64L-Y66H- 
GFP and F64L-GFP, respectively, fusions, such as heterokaryon 
formation, can be monitored. 

Translocation of proteins fused to the novel: proteins -of : : the c t 
inVen -^.? an M y isualise d- The translocation of intracellular 

15 proteins to a specific organelle, can be visualised by fusing the 
protein of interest to one fluorescent protein, e.g. F64L-Y66H- 
G ^ : , ^ labelling the organelle with another fluorescent protein 
,e.g. F64L-GFP, which emits light of a different wavelength. 
Translocation can then be detected as a spectral shift of the 

20 fluorescent proteins in the specific organelle. 

Use as a secretion marker. By fusion of the novel proteins to a 
signal peptide or a peptide to be secreted, secretion may be 
followed on-line in living cells. A precondition for that is that 
the maturation of a detectable number of novel fluorescent protein 
25 molecules occurs faster than the secretion. This appears not to 
be the case for the fluorescent proteins GFP or Y66H-GFP of the 
prior art. 

Use as genetic reporter or protein tag in transgenic animals. Due 
to the strong fluorescence of the novel proteins, they are 
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suitable as tags for proteins and gene expression, since the 
signal to noise ratio is significantly improved over the prior art 
proteins, such as wild-type GFP. 

Use as a cell or organelle integrity marker. By co-expressing two 
5 of the novel proteins, the one targeted to an organelle and the 
other expressed in the cytosol, it is possible to calculate the 
relative leakage of the cytosol ic protein and use that as a 
measure of cell integrety. 

Use as a marker for changes in cell morphology. Expression of the 
10 novel proteins in cells allows easy detection of changes in cell 
morphology, e.g. blebbing, caused by cytotoxic agents or 
apqptosis. Such morphological changes are difficult to visualize 
in intact cells without the use of fluorescent probes. 

Use as a trans feet ion marker, and as a marker to be used in 
15 combination with FACS sorting. Due to the increased brightness of 
the novel proteins the quality of cell detection and sorting "can*" 
be significantly improved. 

Use of the novel proteins as a ratio real-time kinase probe. By 
simultaneous expression of, e.g. F64L-GFP (or F64L-Y66H-GFP) , 
20 which emits more light upon phophorylation and -a derivative of 
F64L— Y66H-GFP which emits less light upon phophorylation. Thereby, 
the ratio of the two intensities would reveal kinase activity more 
accurately than only one probe. 

Use as real-time probe working at near physiological 
25 concentrations. Since the novel proteins are significantly 
brighter than wild type GFP and prior art derivatives at about 
37 *C the concentration needed for visualisation can be lowered. 
Target sites for enzymes engineered into the novel proteins, e.g. 
F64L-Y66H-GFP or F64L— GFP, can therefore be present in the cell 
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at low concentrations in living cells. This is important for two 
reasons: l) The probe must interfere as little as possible with 
the intracellular process being studied; 2) the translational 
and transcriptional apparatus should be stressed minimally. 

5 The novel proteins can be used as real time probes based on energy 
transfer. A probe system based on energy transfer from, e.g. F64L- 
Y66H-GFP to F64L-GFP. 

The novel proteins can be used as reporters to monitor live/dead 
biomass of organisms, such as fungi. By constitutive expression 
10 of F64L-Y66H-GFP or F64L-GFP in fungi the viable biomass will 
light up. 

Transposon vector mutagenesis can be performed using the novel 
proteins as markers in transcriptional and translational fusions. 

Transposons ■ to be used in microorganisms . encoding the novel ^ 
15 proteins. The transposons may be constructed for translational and 
transcriptional fusions. To be used for screening for promoters. 

Transposon vectors encoding the novel proteins, such as F64L-Y66H- 
GFP and F64L-GFP, can be used for tagging plasmids and 
chromosomes . 

20 Use of the novel proteins enables the study of transfer of 
conjugative plasmids, since more than one parameter can be 
followed in living cells. The plasmid may be tagged by F64L-Y66H- 
GFP or F64L-GFP and the chromosome of the donor/recipient by F64L- 
Y66H-GFP or F64L-GFP. 

25 Use as a reporter for bacterial detection by introducing the novel 
proteins into the genome of bacteriophages. 
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By engineering the novel proteins, e.g. F64L-Y66H-GFP or F64L-GFP, 
into the genome of a phage a diagnostic tool can be designed. 
F64L-Y66H-GFP or F64L-GFP will be expressed only upon trahsfection 
of the genome into a living host. The host specificity is defined 
5 by the bacteriophage. 

Any novel feature or combination of features described herein is 
considered essential to this invention. 

DETAILED DESCRIPTION OF THE INVENTION. 

In a preferred embodiment of the present invention, the novel 
10 fluorescent protein is the F64L mutant of GFP or the blue variant 
Y66H-GFP, said mutant showing increased fluorescence intensity. 
A preferred sequence of the gene encoding GFP derived from 

Aeguorea victoria is disclosed in Fig. 2 herein. Fig. 2 shows' the 

• • i -v - ■ •.-,». \/ jr:.^ri-laejoc 
nucleotide sequence of a wild-type GFP (Hind3-EcbRl fragment) and 

15 the amino acid sequence, wherein start codon ATG corresponds^ to 

position 8 and stop codon TAA corresponds to position 722 In the 

nucleotide sequence. "A microorganism, E. coli NN049087, carrying 

the DNA sequence shown in Fig. 2 has been deposited for the 

purpose of patent procedure according to the Budapest Treaty in 

20 Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH, 

Mascheroderweg l b, D-3 8124 Braunschweig, Federal Republic of 

Germany, under the deposition No. DSM 10260. Another sequence of 

an isotype of this gene is disclosed by Prasher et al . , Gene 111 . 

1992, pp. 229-233 (GenBank Accession No. M62653) . Besides, the 

25 novel fluorescent proteins may also be derived from other 

fluorescent proteins, e.g. the fluorescent protein of the sea 

pansy Renilla reniformis. 

Herein the abbreviations used for the amino acids are those stated 
in J. Biol. Chem. 243 (1968), 3558. 
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The DNA construct of the invention encoding the novel fluorescent 
proteins may be prepared synthetically by established standard 
methods, e.g. the phosphoamidite method described by Beaucage and 
Caruthers, Tetrahedron Letter.*; 2Z (1981), 1859 - 1869, or the 
5 method described by Matthes et al., EMBO Journal 3 (1984), 801 - 
805. According to the phosphoamidite method, oligonucleotides are 
synthesized, e.g. in an automatic DNA synthesizer, purified, 
annealed, ligated and cloned in suitable vectors. 

The DNA construct may also be prepared by polymerase chain 
10 reaction (PCR) using specific primers, for instance as described 
in US 4,683,202 or Saiki et al., Science 239 (1988), 487 - 49L 
A more recent review of PCR methods may be found in PCR Protn^ic , 
1990, Academic Press, San Diego, California, .USA. 

The DNA construct of the invention may be inserted into a 
15 recombinant vector which may be any vector which may conveniently t 
be subjected to recombinant DNA procedures. The, choice of vector 
will often depend on the host cell into which it is to be - 
introduced. Thus, the vector may be an autonomously replicating U 
vector, i.e. a vector which exists as an extra chromosomal entity, 
20 the replication of which is independent of chromosomal 
replication, e.g. a plasmid. Alternatively, the vector may be one 
which, when introduced into a host cell, is integrated into the 
host cell genome and replicated together with the chromosome (s) 
into which it has been integrated. 

25 The vector is preferably an expression vector in which the DNA 
sequence encoding the fluorescent protein of the invention is 
operably linked to additional segments required for transcription 
of the DNA. In general, the expression vector is .derived from 
plasmid or viral DNA, or may contain elements of both. The term, 

3 0 "operably linked" indicates that the segments are arranged so that 
they function in concert for their intended purposes, e.g. 
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transcription initiates in a promoter and proceeds through the DNA 
sequence coding for the fluorescent protein of the invention. 

The promoter may be any DNA sequence which shows transcriptional 
activity in the host cell of choice and may be derived from genes 
5 encoding proteins either homologous or heterologous to the host 
cell, including native Aequorea GFP genes. 

Examples of suitable promoters for directing the transcription of 
the DNA sequence encoding the fluorescent protein of the invention 
in mammalian cells are the SV4 0 promoter (Subramani et al., Mol. 
10 Cell Biol. X (1981), 854 -864), the MT-1 (metallothionein gene) 
promoter (Palmiter et al., Science 222 (1983), 809 - 814) or the 
adenovirus 2 major late promoter - 

An example of a suitable promoter for use in insect cells is the 
polyhedrin promoter (US 4,745,051; Vasuvedan et al . . FEBS' Letrt? 

15 311f (1992) 7 - 11), the P10 promoter (J.M. Vlak et al., J. Gen. 
Virology 69, 1988* pp. 765-776), the Autbgrapha californica 
polyhedrosis virus basic protein promoter (EP 3 97 485) , the 
baculovirus immediate early gene 1 promoter (US 5,155,037; US 
5,162,222), or the baculovirus 39K delayed-early gene promoter (US 

20 5,155,037; US 5,162,222). 

Examples of suitable promoters for use in yeast host cells include 
promoters from yeast glycolytic genes (Hitzeman et al . , J. Biol. 
Chem. 255 (1980) , 12073 - 12080; Alber and Kawasaki, J. Mol . AppI . 
Gen, i (1982) , 419 - 4 34) or alcohol dehydrogenase genes (Young 
25 et al., in Genetic Engineering of Microorganisms for Chemicals 
(Hollaender et al, eds.), Plenum Press, New York, 1982), or the 
TPI1 (US 4,599,311) or ADH2-4c (Russell et al., Nature 304 (1983), 
652 - 654) promoters. 

Examples of suitable promoters for use in filamentous fungus host 
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cells are, for instance, the App 3 promoter (McKnight et al The 
FMB ° 4 (1985 > • 2093 ~ 2099) or the teiA promoter. Examples of 
other useful promoters are those derived from the gene encoding 
A. oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase A 
5 niger neutral a-amylase, A. niger acid stable a-a*ylase, A. niger 
or A. awamori glucoamylase (gluA) , Rhizomucor miehei lipase A 
oryzae alkaline protease, A. oryzae triose phosphate isomerase or 
A. nidulans acetamidase. Preferred are the TAKA-amylase and gluA 
promoters. 

10 Examples of suitable promoters for use in bacterial host cells 
include the promoter of the Bacillus stearothermophilus maltogehic 
amylase gene, the Bacillus lichenif ormis alpha -amylase gene", the 
Bacillus amyloliquefaciens BAN amylase gene, the Bacillus subtil is 
alkaline protease gene, or the Bacillus pumilus xylosidase gene 

15 or by the phage Lambda P R or P L promoters or the, E. col i Ofic.*^* 
or tac prompters. J - - r •> 

The DNA sequence encoding the novel fluorescent proteins of the 
invention may also, if necessary, be operably connected - to a 
suitable terminator, such as the human growth hormone terminator 

20 (Palmiter et al., oj^ cit^) or (for fungal hosts) the TJLE1 (Alber 
and Kawasaki, oj^ sl^.) or AEH3 (McKnight et al., oj^ cit_) 
terminators. The vector may further comprise elements such as 
polyadenylation signals (e.g. from SV40 or the adenovirus 5 Elb 
region), transcriptional enhancer sequences (e.g. the SV40 

25 enhancer) and translational enhancer sequences (e.g. the ones 
encoding adenovirus VA RNAs) . 

The recombinant vector may further comprise a dna sequence 
enabling the vector to replicate in the host cell in- question An 
example of such a sequence (when the host cell is a mammalian 
30 cell) is the SV40 origin of replication. 
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When the host cell is a yeast cell, suitable sequences enabling 
the vector to replicate are the yeast plasmid 2fi replication genes 
REP 1-3 and origin of replication. 

The vector may also comprise a selectable marker, e.g. a gene the 
5 product of which complements a defect in the host cell, such as 
the gene coding for dihydrof olate reductase (DHFR) or the 
Schizosaccharomyces pombe TPI gene (described by P.R. Russell, 
Gene 40, 1985, pp. 125-130) , or one which confers resistance to 
a drug, e.g. ampicillin, kanamycin, tetracyclin, chloramphenicol, 
10 neomycin or hygromycin. For filamentous fungi, selectable markers 
include arodS f pvrG . araB . niaD , sC. 

The procedures used to ligate the DNA sequences coding for the 
fluorescent protein of the invention, the promoter and optionally 
the terminator and/or secretory signal sequence, respectively, and 
15 to insert them into suitable vectors containing the information 
necessary for replication, are well known to persons" skilled in 
the art (cf., for instance, Sambrook et al. . op.cit.) . . 

The host cell into which the DNA construct or the recombinant 
vector of the invention is introduced may be any cell which is 
20 capable of expressing the present DNA construct and includes 
bacteria, yeast, fungi and higher eukaryotic cells. 

Examples of bacterial host cells which, on cultivation, are 
capable of expressing the DNA construct of the invention are 
grampositive bacteria, e.g. strains of Bacillus, such as B. 

25 subtilis, B. lichenif ormis, B. lentus, B. brevis, B. 
stearothermophilus, B. alkalophilus, B. amyloliquef aciens , B. 
coagulans, B. circulans, B. lautus, B. megatherium or B. 
thuringiensis, or strains of Streptomyces , such as S. lividans or 
S. murinus, or gramnegative bacteria such as Echerichia coli. The 

30 transformation of the bacteria may be effected by protoplast 
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transformation or by using competent cells in a manner known per 
se (cf. Sambrook et al., supra ) . 

Examples of suitable mammalian cell lines are the HEK293 and the 
HeLa cell lines, primary cells, and the COS (e.g. ATCC CRL 1650) 
5 BHK (e.g. ATCC CRL 1632, ATCC CCL 10), CHL (e.g. ATCC- CCL39) or 
CHO (e.g. ATCC CCL 61, cell lines. Methods of transfecting 
mammalian cells and expressing DNA sequences introduced in the 
cells are described in- e.g. r Kaufman and Sharp", J. Moi.-w4» T 159 
(1982) , 601 - 621; Southern and Berg, J. MoT . > rp l . r .«.n«,- N 
10 (1982) , 327 - 341;, Loyter et al. , Proc. M,«-l . ^qad. -firi n ^" ^ 
(1982), 422 - 426; Wigler et al.. Cell JL4 (1978), 725; Corsaro and 
PearSOn ' Soma ^" on GenPflr^ 2 (1981), 603, Graham and van der 
Eb, Virology sz (1973), 456; and Neumann et al., EMBO .T. i (ig 82 ) 
841 - 845. .] 7 ~ ' 

15 Examples^ ^suitable yea st cells include cells of Saccharomyces 
SPP ' , ° r ; f baromyces spp .', , in particular h- strains ' -of 

Saccharomyces cerevisiae or Saccharomyces kluyyeri ^Methods for 
transforming yeast cells with heterologous DNA and producing 
heterologous polypeptides therefrom are described, e g in US 

20 4,599,311, US 4,931,373, US 4,870,008, 5,037,743, and US 
4,845,075, all of which are hereby incorporated by reference. 
Transformed cells are selected by a phenotype determined by a 
selectable marker, commonly drug resistance or the ability to grow 
in the absence of a particular nutrient, e.g. leucine. A preferred 
25 vector for use in yeast is the POT1 vector disclosed in us 
4.931,373. The DNA sequence encoding the fluorescent protein of 
the invention may be preceded by a signal sequence and optionally 
a leader sequence . e.g. as described above. Further examples of 
suitable yeast cells are strains of Kluyveromyces, such as K. 
30 lactis, Hansenula, e.g. H. polymorpha, or Pichia, e.g. p. pastoris 
(cf. Gleeson et al. , pr. Gen . nj; crob j pl . 1986> pp 345g _ 3465 . 

OS 4,882,279) . ' 
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Examples of other fungal cells are cells of filamentous fungi, 
e.g. Aspergillus spp. , Neurospora spp. , Fusarium spp. or 
Trichoderma spp. , in particular strains of A. oryzae, A. nidulans 
or A. niger. The use of Aspergillus spp. for the expression of 
5 proteins is described in, e.g., EP 272 277, EP 230 023, EP 184 
438. 

When a filamentous fungus is used as the host cell, it may be' 
transformed with the DNA construct of the invention, conveniently 
by integrating the DNA construct in the host chromosoine l £o obtain 
10 a recombinant host cell. This integration is generally considered 
to be ah advantage as the DNA sequence is more likely to be stably 
maintained in the cell. Integration of the DNA constructs into 
the host chromosome may be performed according to conventional 
methods, e.g. by homologous or neterologoiis recombination. 

15 Transformation of insect cells and production of heterologous 
polypeptides therein may be performed as described in US 
4,745,051; US 4,879,236; US 5,155,037; 5 , 162 , 222 ; EP 397 ,"485) "all 
of which are incorporated herein by reference. The insect cell ^ 
line used as the host may suitably be a Lepidoptera cell line, ^ 

20 such as Spodoptera frugiperda cells or Trichoplusia ni cells (cf . 
US 5,077,214) . Culture conditions may suitably be as described in, 
for instance, WO 89/01029 or WO 89/01028, or any of the 
aforementioned references. 

The transformed or transfected host cell described above is then 
25 cultured in a suitable nutrient medium under conditions permitting 
the expression of the present DNA construct after which the cells 
may be used in the screening method of the invention. 
Alternatively, the cells may be disrupted after which cell 
extracts and/or supernatants may be analysed for fluorescence. 

30 The medium used to culture the cells may be any conventional 
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medium suitable for growing the host cells, such as minimal or 
complex media containing appropriate supplements. Suitable media 
are available from commercial suppliers or may be prepared 
according to published recipes (e.g. in catalogues of the American 
5 Type Culture Collection) . , 

In the method of the invention, the fluorescence of cells 
transformed or trans fected with the DNA construct of the invention 
may suitably be measured in a spectrometer or a fluorescence 
microscope where the spectral properties of the cells in liquid 
10 culture may be determined as scans of light excitation and 
emission. 

The invention is further illustrated in the following examples 
with reference to the appended drawings. 



Example 1. _ nc ..^ 

15 Cloning of cDNA encoding GFP r 

Briefly, total RNA, isolated from A. victoria by a standard 
procedure (Sambrook et al., Molecular Cloning. 2., eds. (1989) 
(Cold Spring Harbor Laboratory Press: Cold Spring Harbor, New 
York *..' 7 - 19 " 7 - 2 2) was converted into cDNA by using the AMV reverse 
20 transcriptase (Promega, Madison, wi, USA) as recommended by the 

manufacturer. The cDNA was then PCR amplified, using PGR primers 

designed on the basis of a previously published GFP sequence 
(Prasher et al . , Gene ill (1992 ), 229-233 ; GenBank accession No. 

M62653) together with the UlTraa™ polymerase (Perkin Elmer, Foster 
25 City, CA, USA). The sequences of the primers were: GFP2 : 

TG GAAT AAGCTTT ATG AGT AA AGG AG AAG A A C TTT T and GFP-1 : 

AAG AATTCGG ATCC CTTTAGTGT CAATTGG AAGTCT 

Restriction endonuclease sites inserted in the 5 f (a Hindlll site) 
and 3' (EcoRI and BamHI sites) primers facilitated the cloning of 
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the PCR amplified GFP cDNA into a slightly modified pUC19 vector. 
The details of the construction are as follows: LacZ Shine- 
Dalgarno AGGA, immediately followed by the 5 1 Hindlll site plus 
an extra T and the GFP ATG codon, giving the following DNA 
5 sequence at the lacZ-promoter GFP fusion point: P - 
AGGAAAGCTTTATG-GFP. At the 3» end of the GFP cDNA, the base pair 
corresponding to nucleotide 770 in the published GFP sequence 
(GenBank accession No. M62653) was fused to the EcoRI site of the 
pUC19 multiple cloning" site (MCS) through a PGR generated BamHI, 
10 EcoRI linker region) . 

The DNA seguence and predicted primary amino acid sequence of GFP 
is shown below in Fig. 2a. Another DNA sequence encoding the same 
amino acid sequence as shown in Fig. 2a is shown in Fig. 2b. To 
generate the blue fluorescent variant described by Heim et ~ al . 

15 (1994), a PCR primer incorporating the Y66H 'suDstltution"*. " 
responsible for changing green fluorescence into blue fluorescence 
was used as 5* PCR primer in combination with a GFP specific 3' 
primer. The template was the GFP clone described above. The 
sequence o f t h e 5 • primer is 5 ■ - 

20 CTACCTGTTCCATGGCCAACGCTTGTC^CTACTTTCCTCATGGTGTTCAATGCTTOTCTAGAT 
ACCC-3 1 . Its 5* end . corresponds to position 164 in the GFP 
sequence. In addition to the Y66H substitution, the 5 1 primer 
introduces a A to T change at position 223 ; this mutation creates 
a Xbal site without changing an amino acid. The 5 1 primer also 

25 contains the naturally occur ing Ncol recognition sequence 
(position 173 in the GFP sequence) . The sequence of the 3 ■ primer 
is 5 • — AAGAATTCGGATCCCTTTAGTGTCAATTGGAAGTCT-3 1 . Position 3 from 
the 5 1 end is the first base of the EcoRI recognition site that 
corresponds to the 3 1 end of the GFP sequence. The resulting PCR 

30 product was digested with Ncol and EcoRI and cloned into an Ncol- 
EcoRl vector fragment to reconstitute the entire Y66H-GFP gene. 

E.coli cells carrying an expression vector containing Y66H-GFP 
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were grown overnight in the presence of 10 micrograms per ml N- 
methyl-N-nitro-N-nitrosoguanidine. Plasmid DNA was isolated, the 
764 bp Hind3-EcoRl insert containing Y66H-GFP was isolated and 
cloned into a Hind3-EcoRl digested vector fragment, allowing 
5 expression of the insert in E.coli. E.coli transf ormants were 
inspected for blue fluorescence when excited with a " 3 65 nm uv 
light, and colonies that appeared to fluoresce stronger than 
wildtype BFP were identified. 

10 ng DNA from one particular colony was used as template in a PCR 
10 reaction containing 1.5 units of Tag polymerase (Perkin Elmer), 
O.lmM HnCl 2 , 0.2 mM each of dGTP, dCTP and dTTP, O.OSmM dATP, i.7 
mM MgCl* and the buffer recommended by the manufacturer/ The 

primers used flank the Y66H-GFP insert. The sequence of the 5 < 

primer was .5 * -AATTGGTACCAAGGAGGTAAGCTTTATGAG-3 - ; it r contains^ -a^ ' - V 
15 Hind3 recognition sequence. The sequence of the 3 ' primer was cX 
CTTTCGTTTTGAATTCGGATCCCTTTAGTG-3 • ; it contains a EcoRl recognition^^ 
sequence. . ,^ ,l ^ BJJ &3W ' 

The PCR product was digested with Hind3 and EcoRl and cloned into 
a Hind3-Eco£i digested vector fragment, allowing expression of the 
20 insert in E.coli. E.coli transf ormants were inspected for blue 
fluorescence when excited with a 365 nm UV light, and colonies 
that appeared to fluoresce stronger than Y66H-GFP were identified. 
Plasmid DNA from one strongly fluorescing colony (called BX12-1A) 
was isolated and the Y66H-GFP insert was subjected to sequence 
25 determination. The mutation F64L was identified. This mutation 
replaces the phenylalanine residue preceding the SHG tripeptide 
chromophore sequence of Y66H-GFP with leucine. No other aminoacid 
changes were present in the Y66H-GFP sequence of BX12-1A. The DNA 
sequence and predicted primary amino acid sequence .of F64L-Y66H- 
30 GFP is shown in Fig. 3 below. 

Example 2. 
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F64L-GFP was constructed as follows: An E.coli expression vector 
containing Y66H-GFP was digested with restriction enzymes Ncol and 
Xbal. The recognition sequence of Ncol is located at position 173 
and the recognition sequence of Xbal is located at position 221 
5 in the F64L-Y66H-GFP sequence listed below. The large Ncol-Xbal 
vector fragment was isolated and ligated with a synthetic Ncol- 
Xbal DNA linker of the following sequence: 
One DNA strand has the sequence: 

5 1 -CATGGCCAACGCTTGTCACTACTCTCTCTTATGGTGTTCAATGCTTTT-3 1 
10 The other DNA strand has the sequence: 

5 ' — CTAGAAAAGCATTGAACACCATAAGAGAGAGTAGTGACAAGCGTTGGC-3 ' 

Upon annealing, the. two strands form a Ncol-Xbal fragment that 
incorporates the sequence of the GFP chromophore SYG with the F64L 
substitution preceding SYG. The DNA sequence and predicted primary 
15 amino acid sequence of F64L-GFP is shown in Fig. 4 below. 

The S65T-GFP mutation was described by Heim et al (Nature vol . 373 
pp. 663-664, 1995). F64L— S65T— GFP was constructed as* follows! An 
E.coli expression vector containing Y66H-GFP was digested with 
restriction enzymes Ncol and Xbal. The recognition secpience of 
20 Ncol is located at position 173 and the recognition sequence of 
Xbal is located at position 221 in the F64L-Y66H-GFP sequence 
listed below. The large Ncol-Xbal vector fragment was isolated and 
ligated with a synthetic Ncol-Xbal DNA linker of the following 
sequence: 

25 One DNA strand has the sequence: 

5 ' -CATGGCCAACGCTTGTCACTACTCTCACTTATGGTGTTCAATGCTTTT-3 1 
The other DNA strand has the sequence: 

5 1 -CTAGAAAAGCATTGAACACCATAAGTGAGAGTAGTGACAAGCGTTGGC-3 • 

Upon annealing, the two strands form a Ncol-Xbal fragment that 
3 0 incorporates the F64I* and S65T mutations in the GFP chromophore. 
The DNA sequence and predicted primary amino acid sequence of 
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F64L— S65T-GFP is shown in Fig. 5 below. 

The E. coli expression vector contains an IPTG (isopropyl-thio- 
galactoside) -inducible promoter. The E. coli strain used is a 
del(lacZ)MI5 derivative of K 803 (Sambrook et al. supra). 

5 The GFP allele present in the pGFP-Nl plasmid (available from 
Clontech Laboratories) was introduced into the IPTG inducible 
E.coli expression vector; in the following manner: 
1 ng ^GFP-Nl plasmid DNA was used as template in a standard PCR 
reaction where the 5' PCR primer had the sequence: 
10 5'- TGG AATAAG CTTT ATGAGTAAAGG AG AAG AACTTTT - 3« 
and the 3' PCR primer had the sequence: 
5' - GAAT CGT AG ATCTTTATTTG T ATAG TTCATC C ATG -3*. 

The primers flank the GF?-N1 insert in the vector pGFP-Nl. The 5-- 
primer includes the ATG start codon preceded by a Hind3 cloning 
15 site. The 3' primer includes a TAA stop codon followed by a Bgl2 
cloning site. - .,. ,, 

The PCR product was digested with Hind3 and Bgl2 and cloned into 
a Hind3-BkmHldigested; vector fragment behind an IPTG inducible 
promoter, allowing expression of the insert in E.coli in the 
2 0 presence of IPTG. 

The lacZ gene present in the pZeoSV-LacZ plasmid (available from 
Invitrogen) was introduced into the IPTG inducible E.coli 
expression vector in the following manner: 

1 ng pZeoSV-LacZ plasmid DNA was used as template in a standard 
25 PCR reaction where the 5' PCR primer had the sequence: 
5'- TGGAATAAGCTTTATGGATCCCGTCGTTTTACAACGTCGT - 3» 
and the 3* PCR primer had the sequence: 
5 ' - GCGCGAATTCTTATTATTATTTTTGACACCAGAC - 3 • . 

The primers flank the lacZ insert in the vector pZeoSV-LacZ . The 
30 5< primer includes the ATG start codon preceded by a Hind3 cloning 
site. The 3' primer includes a TAA stop codon followed by an EcoRl 
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cloning site. 

The PCR product was digested with Hind3 and EcoRl and cloned into 
a Hind3- EcoRl digested vector fragment behind an IPTG inducible 
promoter, allowing expression of the insert in E.coli in the 
5 presence of IPTG. 

To measure and compare the fluorescence generated in E. coli cells 
expressing GFP, GFP-N1, F64L-GFP, F64L-S65T-GFP, Y66H-GFP, F64L- 
Y66H-GFP or beta-galactosidase (as background control) under 
various conditions the following experiments were done: 

10 E. coli cells containing an expression plasmid allowing expression 
of one of the various gene products upon induction with IPTG were 
grown in LB medium ^containing 100 micrograms per milliliter 
ampicillin and no IPTG. To 1 ml cell suspension was added 0V5 ml 
50% glycerol and cells were frozen and kept frozen at -86c. 

15 Cells :£rom the - 80C glycerol stocks were inoculated into 2 ml LB 
medium: containing 100 --jig/ml ampicillin and grown with aeration at 
37C for 6 hours. 2 microliters of this inoculum was transferred 
to each of two tubes containing 2 ml of LB medium with 100 /ig/ml 
ampicillin and 1 mM IPTG. The two sets of tubes were incubated 

20 with aeration at two different temperatures: room temperature 
(22C) and 37C. 

After 16 hours 0.2 ml samples were taken of cells expressing GFP, 
GFP-N1, F64L-GFP, F64 L-S65T-GFP, Y66H-GFP, F64L-Y66H-GFP or beta- 
galactosidase. Cells were pelleted, the supernatant was removed, 

25 cells were resuspended in 2 ml water and transferred to a cuvette. 
Fluorescence emission spectra were measured in a LS-50 luminometer 
(Perkin-Elmer) with excitation and emission slits set to 10 nm. 
The excitation wavelengths were set to 398 nm and 470 nm for GFP, 
GFP-N1, F64L-GFP and F64L-S65T-GFP; 398 nm is near the optimal 

30 excitation wavelength for GFP, GFP-N1 and F64L-GFP, and 470 nm 
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xs near the optimal excitation wavelength for F64L-S65T-GFP For 
Y66H-GFP and F64L-Y66H-GFP the excitation wavelength was set to 
380 nm, which is near the optimal excitation wavelength for these 
derivatives. Beta-galactosidase expressing cells were included as 
5 background controls. Following the measurements in the LS- 50 
luminometer, the optical density at 450 nm was measured for each 
sample in a spectrophotometer (Lambda UV/VIS, Perkin-Elmer) This 
is a measure of total cells in the assay. Luminometer data were 
normalized to the optical density of the sample. 

10 The results of the experiments are shown in Fig. 6a - 6f below and 
can be summarized as follows: 



nm 



After 16 hours at 22C using an excitation wavelength of 398 
there were large signals for GFP and F64L-GFP, and detectable 
signals for. ^FP-Ni and F6 4 L— S 6 5T-GFP , cf. Fig. 6a.-- • 

15 After 16 hours at 37C with an excitation wavelength of ^98 hm- 
toere , wa ^ a large signals for F64L-GFP,.,a detectable signal for: 
F64L-S65T-GFP, and no detectable signals for GFP and GFP-N1 cf 
Fig. 6b. '"' ' 

After 16 hours at 22C with an excitation wavelength of 470 nm 
20 there was a large signals for F64L— S65T-GFP, detectable signals 
for GFP and F64L-GFP, and no detectable signals for GFP-N1, cf 
Fig. 6c. 

After 16 hours at 37C with an excitation wavelength of 470 nm 
there were large signals for F64L-S65T-GFP and F64L-GFP, and no 
25 detectable signals for GFP and GFP-N1, cf. Fig. 6d. 

After 16 hours at , 22C with an excitation wavelength of 380 nm 
there were detectable signals over background for Y66H-GFP and 
F64L-Y66H-GFP, cf . Fig. 6e. 
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After 16 hours at 37C with an excitation wavelength of 38 0 nm 
there was no detectable signal over background for Y66H— GFP and 
a large signal for F64L-Y66H-GFP, cf. Fig. 6f. 

To determine whether the differences in fluorescence signals were 
5 due to differences in expression levels, total protein from the 
E.coli cells (0.5 OD 450 units) analyzed as described above was 
fractionated by SDS-polyacrylamide gel electrophoresis (12% Tris- 
glycine gels from BIO-RAD Laboratories) followed by Western blot 
analysis (ECL Western blotting from Amersham International) with 
10 polyclonal GFP antibodies (from rabbit) . The result showed that 
expression levels of GFP, GFP-N1, F64L-GFP, F64L-S65T-GFP, Y66H- 
GFP and F64L— Y66H-GFP were identical, both at 22C and 37C. The 
differences in fluorescence signals are therefore not due to 
different expression levels. 

15 Example 3 . ; • 

Influence of the F64L substitution on GFP and its 
derivatives when expressed in mammalian cells. 



F64L— Y66H—GFP, F64L-GFP, and F64L-S65T-GFP were cloned into pcDNA3 
(Invitrogen, ca, USA) so that the expression was under control 

20 of the CMV promoter. Wild-type GFP was expressed from the pGFP-Nl 
pi asm id (Clontech, Ca, USA) in which the CMV promoter controls the 
expression. Plasmid DNA to be used for transfection were purified 
using Jetstar Plasmid kit (Genomed Inc. NC, USA) and was dissolved 
in distilled water. 

25 The precipitate used for the transf ections were made by mixing the 
following components: 2 /xg.DNA in 44 nl of water were mixed with 
50 pi 2xHBS buffer (280 mM NaCl, 1.5 mM Na 2 HP0 4 , 12 mM dextrose, 
50 mM HEPES) and 6.2 fxl 2M CaCl 2 . The transf ection mix was 
incubated at room temperature for 25 minutes before it was added 
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to the cells. HEK 293 cells (ATCC CRL 1573) were grown in 2 cm by 
2 cm coverglass chambers (Nunc, Denmark) with approximately 1.5 
ml medium (Dulbecco's MEM with glutamax-l, 4500 mg/L glucose, and 
10% FCS; Gibco BRL, MD, USA) . The DNA was added to cells at 25-50% 
5 confluence. Cells were grown at 37°C in a C0 r incubator. Prior to 
visualisation the medium was removed and 1.5 ml Ca 2+ -HEPES buffer 
(5 mM KCl, 140 mM Nad, 5.5 mM glucose, 1 mM MgS0 4 , 1 mM CaCl, 10 
mM HEPES) .was added to the chamber. ' ^ 

Trans fectants were visualised using an Axi overt 135 (Carl Zeiss/ 
10 Germany) fluorescence microscope. The microscope was equipped wi th- 
an HBO 100 mercury excitation source and a 40x, Fluar, NA= 1.3 
objective (Carl : Zeiss, Germany) . To visualise GFP, F64L-GFP, and 
F64L— S65T-GFP the following filters were used: excitation 480/40 
nm, dichroic 505 nm, and emission 510LP nm (all from Chroma 
15 Technologies Corp., Vt, USA). To visualise F64L— Y66H— GFP the 
following filters were used: excitation 380/15 nm, dichroic 400 
nm, and emission 450/65 nm (all from Omega Optical, vt, USA) 
Cells in several chambers were transfected in parallel, so that, 
a new chamber could be taken for each sample point. In cL^s wfie^ 1 
20 the incubation extended beyond 8.5 hours the Ca'^precip it ate ^was^ 
removed by replacing the medium. 

As shown in Table 1 the F64L mutation enhances the fluorescent 
signal significantly (wild type GFP versus F64L-GFP and F64L-S65T- 
GFP) . Fluorescent cells can be observed as early as 1-2 hours 

25 post-transf ection indicating an efficient maturation of the 
chromophore at 37°c Furthermore, the F64L mutation is enhancing 
other GFP derivatives like the S65T mutant which has a shifted 
excitation spectrum and the blue derivative which is not 
detectable in mammalian cells without the F64L substitution. 

30 (Comment: When comparing the results of F64L-S65T-GFP and F64L-GFP 
one has to take into account that the excitation spectra differ 
and that the filter set used is optimised for F64L— S65T— GFP. F64L- 
GFP and WT GFP share the same spectral properties.) 
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Table 1. 




Appearance of fluorescent HEK 293 cells upon trahsfecxion by Ca 2 * precipitation. 



1) Indicates the estimated average intensity of the fluorescent cells. 

2) Indicates the abundance of fluorescent cells. 

3) Cells reached 100% confluence. 
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Claims 

1. A fluorescent protein derived from Green Fluorescent Protein 
(GFP) or any functional analogue thereof, wherein the amino acid 
in position 1 preceding the chromophore has been mutated to 

5 provide an increase of fluorescence intensity. 

2. A fluorescent protein according to the preceding claim, wherein 
the chromophore is in " position 65-67 of the predicted primary 
amino acid sequence of GFP. 

3. A fluorescent protein according, to any one of the preceding 
10 claims resulting in an increased fluorescence in cells expressing 

said fluorescent protein when said cells are incubated at a 
temperature of 30 "C or above 3 0 *C, preferably at a temperature of 
from 32* C to 39*C, more preferably at a temperature of from 35' c 
to 38*C, and most preferably at a temperature of about 37'C. 

15 4 . A fluorescent protein according to any one of the " precedxng 
claims, said protein being derived from Aequore'a victorea or 
Renilla reniformis. 

5. A fluorescent protein according to any one of the preceding 
claims, wherein the amino acid F in position 64 of GFP or Y66H-GFP 

20 has been substituted by an amino acid selected from the group 
consisting of L, I, V, A and G. 

6. A fluorescent protein according to any one of the preceding 
claims, wherein the amino acid F in position 1 preceding the 
chromophore has been substituted by L and the amino acids of the 

25 chromophore include SYG, SHG or TYG. 

7. A fluorescent protein according to any one of the preceding 
claims and having the amino acid sequence of Fig. 3 , Fig. 4 or 
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Fig. 5 herein. 

8. A fusion compound consisting of a fluorescent protein (GFP) 
according to any one of claims 1-7, wherein said GFP is linked to 
a polypeptide. t 

5 9. A fusion compound according to the preceding claim wherein the 
polypeptide is a kinase, preferably the catalytic subunit of 
protein kinase A, or protein kinase C, or Erkl, or a cytoskeletal 
element. 

10. A nucleotide sequence coding for the Fluorescent Protein of 
10 claim 1, 2 , 3, 4, 5, 6 or 7 or, for a fusion compound according to 

claim 7 or 8. 

11. A nucleotide sequence according to the preceding claim 
selected from the sequences shown in Fig. 3, Fig. 4 and Fig. >5. - r 

12 ' -p r,™* , construct comprising a suitable control region or 
15 regions and a nucleotide sequence according to . claim 10 or 11, the 
sequence being under the control of the control region. 

13. A DNA construct according to the preceding claim being under 
the control of the native GFP promoter, or a mammal constitutive 
or regulatory promoter, a viral promoter, a yeast promoter, a 

20 filamentous fungi promoter, or a bacterial promoter. 

14. A host transformed with a DNA construct according to claim 12 
or 13. 

15. A host according to the preceding claim selected from the 
following: organisms and cells belonging to bacteria, yeast, 

25 fungi, protozoans and higher eucaryots . 
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16. A process for preparing a polypeptide, comprising cultivating 
a host according to claim 14 or 15 and obtaining therefrom the 
polypeptide expressed by said nucleotide sequence. 

17. A process according to the preceding claim wherein the 
5 expression of the nucleotide sequence is effected by the native 

GFP promoter. 

18. Use of the Fluorescent Protein of claim 1, 2, 3, 4, 5, 6 or 
7 in an in vitro assay for measuring protein kinase activity, or 
dephosphorylation activity, wherein said fluorescent protein in 

10 purified form is added to a biological sample, preferably a cell 
extract, and any change in fluorescence is recorded. 

19. Use of the host of claim 14 or 15 in an in vivo assay for 
measuring, metabolic activity, preferably kinase activity and 
dephosphorylating activity. 

15 20. Use of the fluorescent protein of claim 1, 2, -3, 4, 5, 6 or 
7 as a reporter- for gene expression in living cells. 

21. Use of the fluorescent protein of claim 1, 2, 3, 4, 5, 6 or 
7 for the simultaneous monitoring of more than one gene in living, 
intact cells. 

20 22. Use of two or more of the fluorescent protein of claim 1, 2, 
3, 4, 5, 6 or 7 as organelle or cell tags for simultaneous 
visualisation of organelle or cell processes in living cells. 
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DNA. and predicted primary amino acid sequence of GFP (Hind3-EcoRl 
fragment) . 



5' - AAGCTTT 



ATG 
KET 


AGT 
SER 


AAA 
LYS 


GGA 
GLY 


GAA 
GLU 


GAA 
GLU 


CTT 
LEU 


TTC 
PHE 


ACT 
THR 


GGA 
GLY 


GTT 
VAL 


GTC 
VAL 


CCA 
PRO 


ATT 
ILE 


CTT 
LEU 


GTT 
VAL 


GAA 
GLU 


TTA 
LEU 


CAT 
ASP 


GGC 
GLY 


GAT 
ASP 


GTT 
VAL 


AAT 
ASN 


GGG 
GLY 


CAA 
GLN 


AAA 
LYS 


TTC 
PHE 


TCT 
SER 


GTT 
VAL 


AGT 
SER 


GGA 
GLY 


GAG 
GLU 


GGT 
GLY 


GAA 
GLU 


GGT 
GLY 


GAT 
ASP 


QCA 
ALA 


ACA 
THR 


TAC 
TYR 


GGA 
GLY 


AAA 
LYS 


CTT 
LEU 


ACC 
THR 


CTT 
LEU 


AAA 
LYS 


TTT 
PHE 


ATT 
ILE 


TGC 
CYS 


ACT 
THR 


ACT 
THR 


GGG 
GLY 


AAG 
LYS 


CTA 
LEU 


CCT 
PRO 


GTT 
VAL 


CCA 
PRO 


TGG 
TRP 


CCA 
PRO 


ACG 
THR 


CTT 
LEU 


CTC 
VAL 


ACT 
THR 


ACT 
THR 


TTC 
PHE 


TCT 
SER 


TAT 
TYR 


GGT 
GLY 


GTT 
VAL 


CAA 
GLN 


TGC 
CYS 


TTT 
PHE 


TCA 
SER 


AGA 
ARG 


TAC 
TYR 


CCA 
PRO 


GAT 
ASP, 


CAT 
MIS 


ATG 
MET 


AAA 
LYS 


CAG 
GLN 


CAT 
HIS 


GAC 
ASP 


TTT 
PRE 


TTC 
PHE 


LYS 


AGT 
SER 


GCC 
ALA 


KET 


FRO 


GLU 


GGT 
GLY 


TAT 
TYR 


GTA 
VAL 


CAG 
' GLN 


GAA 
GLU 


AGA.- ACT 
ARG THR 


ATA 
ILE 


TTT 
PHE 


TAC 
TYR 


AAA 
LYS 


GAT 
ASP 


GAC 
ASP 


GGG 
GLY 


AAC 
ASN 


TAC 
TYR 


AAG 
LYS 


ACA 
THR 


CGT 
ARG 


GCT 
ALA 


GAA 
GLU 


GTC 
VAL 


AAG 
LYS 


TTT 
PHE 


GAA 
GLU 


GGT 
GLY 


GAT 
ASP 


ACC 
THR 


CTT 
LEU 


GTT 
VAL 


AAT 
ASN 


AGA 
AUG 


ATC 
ILE 


GAG 
GLU 


TTA 
LEU 


AAA 
LYS 


GGT 
GLY 


ATT 
ILE 


GAT 
ASP 


TTT 
PHE 


AAA 
LYS 


GAA 
GLU 


GAT 
ASP 


GGA 
GLY 


AAC ATT CTT GGA 
ASN ILE LEU. GLY 


CAC 
HIS 


AAA 
LYS 


ATG 
KET 


GAA 
GLU 


TAC 
TYR 


AAT 
ASK 


TAT 
TYR 


AAC 
ASN 


TCA 
SER 


CAC 
HIS 


AAT 
ASN 


GTA 
VAL 


TAC 
TYR 


ATC 
ILE 


ATG 
MET 


GCA 
ALA. 


GAC 
ASP^ 


AAA 

f LYS]_ 


CCA 
PRO, 


AAG 
LYS 


AAT 
ASN 


GGA 
GLY 


ATC 
ILE 


AAA 
LYS 


GTT 
VAL 


AAC 
ASN 


TTC 
PHE 


AAA 
LYS 


ATT 
ILE 


AGA 
ARG 


CAC 
HIS 


AAC 
ASN 


ATT 
ILE 


AAA 
LYS 


GAT 
ASP 


GGA 
GLY 


AGC 
SER 


GTT. 
VAL 


CAA. TTA 
GLN LEU 


GCA 
ALA 


GAC 
ASP 


CAT 
HIS 


TAT 
TYR 


CAA 
GLN 


CAA 
GLN 


AAT 
ASN 


ACT 
THR 


CCA 
PRO 


ATT 
ILE 


GGC 
GLY 


GAT 
ASP 


GCC 
GLY 


CCT 
PRO 


GTC CTT 
VAL LEU 


TTA 

leu' 


CCA GAC, AAC 
PRO ASP ASN 


CAT 
HIS 


TAC 
TYR 


CTG 
LEU 


TCC 
SER 


ACG 
THR 


CAA 
GLN 


TCT 
SER 


GCC 
ALA 


CTT 
LEU 


TCC 
SER 


AAA 
LYS 


GAT 
ASP 


CCC 
PRO 


AAC 
ASN 


GAA AAG 
GLU LYS 


AGA GAT CAC 
ARG ASP HIS 


ATG 
KET 


ATC 
ILE 


CTT 
LEU 


CTT 


GAG 


TTT 


GTA 


ACA 


GCT 


GCT 


GGG 


ATT 


ACA 


CAT 


GGC 


ATG 


GAT 


GAA 


CTA 


TAC 


AAA 


TAA 





LEU GLU PHE VAL THR ALA ALA GLY ILE THR HIS GLY KET ASP GLU LEU TYR LYS 
ATGTCCAGACTTCCAATTGACACTAAAGGGATCCGAATTC - 3' 



Fig. 2 a 
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Nucleotide sequence (764bp) o f GFP 
(Hmd3-EcoRl fragment) 

AAGCITIATCAGTA^^ 

. CTcce^rrcmra 

AAAAATTCTCTGlTAGriXj^ 

ATAa3GAAMCTTACCCTTAAAm 

AAGCTACCiqT^ 

GAAACAGCATOACTTT^^ 
• -TATCTACAGGAAAGAACTATATT^^ 
" ACTACMGACACGTCCTX^ 

ATAACT(>CATAATGTATACATCATGGC^ 
GAAltXK^TCAAAGTTAACTTCAAAA 

AAAGATGGAAGCGTTCAATTAGCAGACCATTATCAACA^ 

ATACTCCAATra^CGATGGCCC^ 

CCATTACCTGTCCACGCAATCTGCCCTTTCC^ 

AACGAAAAGAGAGATCACATGATC 

CAGCIX3CTX3GGATrACACATX3^ 

ATAAATCnrCAGACTTCCAATTGACACT - 
ATTC 



Fig. 2b 
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EM\ and predicted primary amino acid sequence of F64L-Y66H-GFP (Hind3- 
EcoRl f ragmen t) . 



5' - AAGCTTT 



ATG 
KET 


AGT 
SER 


AAA 
LYS 


GGA 
GLY 


GAA 
GLU 


GAA 
GLU 


CTT 
LEU 


TTC 
PHE 


ACT 
THR 


GGA 
GLY 


CTT 
VAL 


GTC 
VAL 


CCA 
PRO 


ATT 
ILE 


CTT 
LEU 


GTT 
VAL 


GAA 
GLU 


TTA 
LEU 


GAT 
ASP 


GGC 
GLY 




GAT 
ASP 


GTT 
VAL 


AAT 
ASN 


GGG 
GLY 


CAA 
GLN 


AAA 
LYS 


TTC 
PHE 


TCC 
SER 


GTT 
VAL 


ACT 
SER 


GGA 
GLY 


GAG 
GLU 


GGT 
GLY 


GAA 
GLU 


GGT 
GLY 


GAT 
ASP 


GCA 
ALA 


ACA 
THR 


TAC 
TYR 


GGA 
GLY 




AAA 

lys 


CTT 
LEU 


ACC 
THR 


CTT 
LEU 


AAA 
LYS 


TTT 
PHE 


ATT 
ILE 


TGC 
CYS 


ACT 
THR 


ACT 
THR 


GGG 
GLY 


AAG 
LYS 


CTA 
LEU 


CCT 
PRO 


GTT 
VAL 


CCA 
PRO 


TGG 
TRP 


CCA 
PRO 


ACG 

-THR 


CTT 
LEU 




GTC 

VAJL 


ACT 
THR 


ACT 
THR 


CTC 
LEU 


TCT 
SER 


CAT 
HIS 


GGT 
CLY 


GTT 
VAL 


CAA 
GLN 


TGC 
CYS 


TTT 
PHE 


TCT 
SER 


AGA 
ARG 


TAC 'CCA" 
TYR PRO 


GAT 
ASP 


CAT 
HIS 


ATG AAA* CAG 

MET ..LYS j,.X3LN^. 




CAT 
HIS 


GAC 
ASP 


TTT 
PHE 


TTC 
PHE 


AAG 
LYS 


AGT 
SER 


CCC 
ALA 


ATG 
KET 


CCC 
PRO 


GAA 


GGT 
GLY 


TAT 
TYR 


GTA 

VAL 


CAG 
GLN 


GAA 
GLU 


AGA 
ARG 


ACT 
THR 


ATA 

ILE , 


TTT 

■PHE 


TAC 
TYR 




AAA 
LYS 


GAT 
ASP 


GAC 
ASP . 


GGG 
CLY 


AAC 

ASN . 


TAC 
TYR 


AAG 
LYS 


ACA 
THR 


CGT 
ARG 


GCT 
ALA 


GAA 
GLU 


GTC 
VAL 


AAG 
LYS 


TTT 
PHE 


GAA 
GLU 


GGT 
GLY 


GAT 
ASP 


ACC 
THR 


CTT 
LEU 


GTT 
VAL 




AAT 
ASN 


AG A 
ARG 


ATC 
ILE 


GAG 
GLU 


TTA 
LEU 


AAA 
LYS 


GGT 
GLY 


ATT 
ILE 


GAT 
ASP 


TTT 
PHE 


AAA 
LYS 


GAA 
GLU 


GAT 
ASP 


GGA 
GLY 


AAC 
ASN 


ATT 
ILE 


CTT 
LEU 


GGA 
GLY 


CAC 
HIS 


AAA 
LYS 




ATG 
MET 


GAA 
CLU 


TAC 
TYR 


AAT 
ASN 


TAT 
TYR 


AAC 
ASN 


TCA 
SER 


CAT 
HIS 


AAT 
ASN 


GTA 
VAL 


TAC 
TYR 


ATC 
ILE 


ATG 
MET 


GCA 
ALA 


GAC 
ASP 


AAA 
LYS 


CCA 
PRO 


AAG 
LYS 


AAT 
ASN 


GGC 
GLY 




ATC 
TT.K 


AAA 
LYS 


CTT 
VAL 


AAC 
ASN 


TTC ■ 
PHE 


AAA 
LYS 


ATT 
ILE 


AGA 
ARG 


CAC 
HIS 


AAC 
ASN 


ATT 
ILE 


AAA 
LYS 


GAT 
ASP 


GGA 
GLY 


ACC 
SER 


GTT 
VAL 


CAA 
GLN 


TTA 

LEU, 


GCA 

, ALA 


GAC 
ASP 




CAT 
HIS 


TAT 
TYR 


CAA 
CLN 


CAA 
GLN 


AAT 
ASN 


ACT 
THR 


CCA 
PRO 


ATT 
ILE 


GGC 
GLY 


GAT 
ASP 


GGC 
GLY 


CCT 
PRO 


GTC 
VAL 


CTT TTA 
LEU LEU 


CCA" 
PRO 


? GAC" 
ASP 


AAC 
ASN 


CAT' 
HIS^ 


'■TAC" 
.TYR, 




CTG 
LEU 


TCC 
SER 


ACG 
THR 


CAA 
GLN 


TCT 
SER 


CCC 
ALA 


CTT 
LEU 


TCC 
SER 


AAA 
LYS 


GAT 
ASP 


CCC 
PRO 


AAC 
ASN 


GAA 
GLU 


AAG 
LYS 


AGA 
ARG 


GAT 
ASP 


CAC 
HIS 


ATG 

MET. 


ATC 
* LE 


■ CTT ' 
LEU 




CTT 
LEU 


GAG 
GLU 


TTT 
PHE 


GTA 
VAL 


ACA 
THR 


GCT 
ALA 


GCT 
ALA 


GGG 
GLY 


ATT 
ILE 


ACA 
THR 


CAT 
HIS 


GGC 
GLY 


ATG 
MET 


GAT 
ASP 


GAA 
GLU 


CTA 
LEU 


TAC 
TYR 


AAA 
LYS 


taa' 







ATGTCCAGACTTCCAATTGACACTAAAGGGATCCGAATTC- 3' 



Fig. 3 



">CID:«iWO 9711094A1>. 



WO 97/11094 PCI7DK96/00051 

5/12 

DNA and predicted primary amino acid sequence of F64L-GFP (Hind3- 
EcoRl f ragmen t ) . 



5* - AAGCTTT 



ATG 
MET 


AGT AAA 
SER LYS 


GGA 
GLY 


GAA 
GLU 


GAA 
GLU 


CTT 
LEU 


TTC 
PHE 


ACT 
THR 


GGA 
GLY 


GTT 
VAL 


GTC 
VAL 


CCA 
PRO 


ATT 
ILE 


CTT 
LEU 


GTT 
VAL 


GAA 
GLU 


TTA 
LEU 


GAT 
ASP 


GGC 
GLY • 


GAT 
ASP 


GTT AAT 
VAL ASN 


GGG 
GLY 


CAA 
GLN 


AAA 
LYS 


TTC 
PHE 


TCT 
SER 


GTT 
VAL 


AGT 
SER 


GGA 
GLY 


GAG 
GLU 


GGT 
GLY 


GAA 
GLU 


GGT 
GLY 


GAT 
ASP 


GCA 
ALA 


ACA 
THR 


TAC 
TYR 


GGA 
GLY 


AAA 
LYS 


CTT ACC 
LEU THR 


CTT 
LEU 


AAA 
LYS 


TTT 
PHE 


ATT 
ILE 


TGC 
CYS 


ACT 
THR 


ACT 
THR 


GGG 
GLY 


AAG 
LYS 


CTA 
LEU 


CCT 
PRO 


GTT 
VAL 


CCA 
PRO 


TGG 
TRP 


CCA ACG CTT 
PRO: THR; LEU, 


GTC 
VAL 


ACT ' ACT 
THR THR 


CTC TCT 
LEU SER 


TAT 
TYR. 


GGT 
GLY. 


GTT 
VAL 


CAA 
GLN 


TCC 
CYS 


TTT 
PHE 


TCT 
SER 


AGA TAC 
ARG TYR 


CCA 
PRO 


GAT 
ASP 


CAT ATG 

HIS 'MET, 


AAA CAG. 
XYS GLN. £V 


CAT 
HIS 


GAC TTT 
ASP PHE 


TTC 
PHE 


AAG 
LYS 


AGT 
SER 


GCC 
ALA 


ATG 
MET 


CCC 
PRO 


GAA 
GLU 


GLY 


TAT 
TYR 


GTA 
VAL 


CAG 
GLN 


GAA 
GLU 


AGA 
ARG 


ACT 
THR 


ATA 
ILE 


TTT 
PHE 


TAC ' 
TYR 


AAA 
LYS 


GAT GAC 
ASP ASP 


GGG 
GLY 


AAC 
ASN 


TAC 
TYR. 


AAG 
LYS 


ACA 
THR 


CGT 
ARG 


GCT 
ALA 


GAA 
GLU 


CTC 
VAL 


AAG 
LYS 


TTT 
PHE 


GAA 
GLU 


GGT 
GLY 


GAT 
ASP 


ACC 
THR 


CTT 
LEU 


GTT 
VAL. . 


AAT 
ASN 


AGA ATC 
ARG ILE 


GAG 

GLU . 


TTA 
LEU 


AAA 
LYS 


GGT 
GLY 


ATT 
ILE 


GAT 
ASP 


TTT 
PHE 


AAA 
LYS 


GAA 
GLU 


GAT 
ASP 


GGA 
GLY 


AAC ATT 

•ASN, ILE 


CTT 
LEU 


-GGA 
GLY 


CAC 
HIS 


AAA 
LYS 


ATG 
MET 


GAA TAC AAT 
GLU TYR ASM 


TAT AAC 
TYR, ASN 


TCA 
■SER 


CAT 
HIS 


AAT 
ASN 


GTA 
VAL 


TAC 
TYR 


ATC 
ILE 


ATG GCA 
MET ALA 


GAC 
ASP 


AAA 


CCA AAG 

PRO -LYS/; 


AAT 

ASN i 


GGC 

_GLY;.i-', 


ATC 
ILE 


AAA'"'" GTT ^ AAC 
LYS VAL ASN. 


TTC 


AAA 
LYS 


ATT 
ILE 


AGA 
ARG 


CAC 
HIS 


AAC 
ASN 


ATT 
ILE 


AAA 
LYS 


GAT 
ASP 


GGA 
GLY 


ACC' 
SER 


TL. .iK 1 ? JA.V :^v.v 
GTT CAA TTA CCA GAC 
VAL-; GLN^LEU.'- ALA-ASP:r £C 


CAT 
HIS 


TAT CAA - ' 
TYR GLN 


CAA' 
GLN 


AAT 
ASN 


ACT 
THR 


CCA 
PRO 


ATT 
ILE 


GGC 
GLY 


GAT 
ASP 


GGC 
GLY 


CCT 
PRO 


GTC 
VAL 


CTT TTA 
LEU LEU 


kt -!■■' 
CCA 
PRO 


•■v. a s^usj j^t eis. 
GAC AAC CAT TAC 
ASP r. AS N.HI S. ; .TYR:; TC- 


CTG 
LEU 


TCC ACG' 
SER THR 


CAA 
GLN 


TCT 
SER 


GCC 
ALA 


CTT 
LEU 


TCC 
SER 


AAA 
LYS 


GAT 
ASP 


CCC 
PRO 


AAC 
ASN 


GAA 
GLU 


AAG 
LYS 


AGA 
ARG 


GAT 
ASP 


CAC 
HIS 


ATG 
MET 


ATC * 
ILE 


CTT 
LEU 


CTT 
LEO 


GAG TTT 
GLU PHE 


GTA 
VAL 


ACA 
THR 


GCT 
ALA 


GCT 
ALA 


GGG 
GLY 


ATT 
ILE 


ACA 
THR 


CAT 
HIS 


GGC 
GLY 


ATG 
MET 


GAT 
ASP 


GAA 
GLU 


CTA 
LEU 


TAC 
TYR 


AAA 
LYS 


TAA 





ATGTCCAGACTTCCAATTGACACTAAAGGGATCCGAATTC - 3* 



Fig. 4 
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ENA. and predicted primary amino acid sequence of F64L-S65T-GFP 
(Hind3-EcoRl fragment). 



5' - AAGCTTT 



ATG 
MET 


ACT 

SER 


AAA 

LYS 


GGA 
GLY 


GAA 
GLU 


GAA 
GLU 


CTT 
LEU 


TTC 
PHE 


ACT 
THR 


GGA 
GLY 


GTT 
VAL 


GTC 
VAL 


CCA 
PRO 


ATT 
ILE 


CTT 
LEU 


GTT 
VAL 


GAA 
GLU 


TTA 
LEU 


GAT 
ASP 


GGC 
GLY 


GAT 
ASP 


GTT 
VAL 


AAT 
ASN 


GGG 
GLY 


CAA 
GLN 


AAA 
LYS 


TTC 
PHE 


TCT 
SER 


GTT 
VAL 


AGT 
SER 


GGA 
GLY 


GAG 
GLU 


GGT 
GLY 


GAA 
GLU 


GGT 
GLY 


GAT 
ASP 


GCA 
ALA 


ACA 
THR 


TAC 
TYR 


GGA 
GLY- 


AAA 
LYS 


CTT 
LEU 


ACC 
THR 


CTT 
LEU 


AAA 
LYS 


TTT 
PHE 


ATT 
ILE 


TGC 

CYS 


ACT 
THR 


ACT 
THR 


GGG 
GLY 


AAG 
LYS 


CTA 

LEU 


CCT 
PRO 


GTT 
VAL 


CCA 
PRO 


TGG 
TRP 


CCA 
PRO 


ACG 
THR 


CTT 
LEU 


GTC 
VAL 


ACT 
THR 


ACT 
THR 


CTC 
LEU 


ACT 
THR 


TAT 
TYR 


GGT 
GLY 


GTT 
VAL 


CAA 
GLN 


TGC 
CYS 


TTT 

PHE 


TCT 
SER 


AGA 
ARG 


TAC 
TYR 


CCA 
PRO 


GAT 
ASP 


CAT 
HIS 


ATG 
MET 


AAA 

LYS 


CAG 
GLN 


CAT 
HIS 


GAC 
ASP 


TTT 
PHE 


TTC 

PHE 


AAG 
LYS 


AGT 
SER 


GCC 
ALA 


ATG 
MET 


CCC 
PRO 


GAA 
GLU 


GGT 
GLY 


TAT 
TYR 


GTA 
VAL 


CAG 
GLN 


GAA 
GLU 


AGA 
ARG 


ACT 
THR 


ATA 
ILE 


TTT 
PHE 


TAC 
TYR 


AAA 
LYS 


GAT 
ASP 


GAC 
ASP 


GGG 
GLY 


AAC 
ASN 


TAC 
TYR 


AAG 
LYS 


AGA 
THR 


CGT 
ARG 


GCT 
ALA 


GAA 
GLU 


GTC 
VAL 


AAG 
LYS 


TTT 
PHE 


GAA 
GLU 


GGT 
GLY 


GAT 
ASP 


ACC^CTT 
THR :LEU 


GTT 
VAL 


AAT 
ASM 


AGA 
ARG 


ATC 
ILE 


GAG 
GLU 


TTA 
LEU 


AAA 
LYS 


GGT 
GLY 


ATT 
ILE 


GAT 
ASP 


TTT 
PHE 


AAA 
LYS 


GAA 
GLU 


GAT GGA 
ASP GLY 


AAC 
ASN 


ATT 
ILE 


CTT 
LEU 


GGA CAC 

GLY;; HIS 


AAA 
LYS 


ATG 
MET 


GAA 
GLU 


TAC 
TYR 


AAT 
ASM 


TAT 
TYR 


AAC 
ASN 


TCA 
SER 


CAT 
HIS 


AAT 
ASN 


GTA 
VAL 


TAC 
TYR 


ATC 
ILE 


ATG 
MET 


GCA 
ALA 


GAC 
ASP 


AAA 
LYS 


CCA AAGIJAAT 
PRO LYSJ^ASN 


GGC 
GLY 


ATC 
ILE 


AAA 
LYS 


GTT 
VAL 


AAC 
ASN 


TTC 
PHE 


AAA 

LYS 


ATT 
ILE 


AGA 
ARG 


CAC 
HIS 


AAC 
ASN 


ATT 
ILE 


AAA 
LYS 


GAT 
ASP 


GGA 
GLY 


AGC 
SER 


CTT 
VAL 


CAA 
GLN 


ttaCcca 

LEU ALA 


GAC 
ASP 


CAT 
HIS 


TAT 
TYR 


CAA 
GLH 


CAA 
GLN 


AAT 
ASN 


ACT 
THR 


CCA 
PRO 


ATT 
ILE 


GGC 
GLY 


GAT 
ASP 


GGC 
GLY 


CCT 
PRO 


GTC 
VAL 


CTT 
LEU 


TTA 
LEU 


CCA 
PRO 


GAC 
ASP 


AAC 
ASN 


CAT 
HIS 


TAC 
TYR 


CTG 
LEU 


TCC 
SER 


ACG 
THR 


CAA 
GLN 


TCT 
SER 


GCC 
ALA 


CTT 
LEU 


TCC 
SER 


AAA 
LYS 


GAT 
ASP 


CCC 
PRO 


AAC 
ASN 


GAA 
GLU 


AAG 
LYS 


AGA 
ARG 


GAT 
ASP 


CAC 
HIS 


ATG 
MET 


ATC 
ILE 


CTT 
LEU 


CTT 
LEU 


GAG 
GLU 


TTT 
PHE 


GTA 
VAL 


AGA 
THR 


GCT 
ALA 


GCT 
ALA 


GGG 
GLY 


ATT 
ILE 


ACA 
THR 


CAT 
HIS 


GGC 
GLY 


ATG 
MET 


GAT 
ASP 


GAA 
GLU 


CTA 
LEU 


TAC 
TYR 


AAA 
LYS 


TAA 





ATGTCCAGACTTCCAATTGACACTAAAGGGATCCGAATTC - 3* 
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